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(57) Abstract: Alignment accuracy beiween two or more panemed layers is measured using a metrology target (10, 20) oompiising 
sabstantially overlapping difiraction gratings (30. 32; 81, 83; 120. 122; 212a-b. 214a-b; 22U; 230a<; 254; 256) formed in a test area 
u( (be layen being tested. An opiical instiuzoenl (40) illuminates all ur purl uf the Uiigel van and measures (he optical response. 
The instrument can measure transmissaon. reflectance, and/or ellipsometric parameteis as a function of wovelenglh. polar'angle 
of incidence, a7.imutha1 angle of incidence, and/or polariradon of the illnminatinn and detected lighL Overlay error or offset (A) 
between those layers containing the test gratings is detennined by a processor programmed (Figs. 19-20) co calculate an opiical 
response for a set of parameters thai include overlay cnoi; using a model that accounts for dii&actioD by the gratings and interaction 
uT the gratings with each othen' diflhtcted Geld. The model parameters mighl also talce aouounl uf nmnutactuied usyminetiies. 
The calculation may imrohre inteipolation of pro-computed entries from a database accessible to the processor. The calculated and 
measured lesponses ate iteradvely cornpared and the model parameters changed to miniinizc the difference. 
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• De^<:ription 

OVEiO^Ay ALIGNMENT ME^OIiOOY 
DSINO DIFFRACTION GERATlNGS 

TECHNICAL FIEliD 

This invention relates to measuring the pattern 
overlay alignmeiiit accuracy of a pair of patterned layers 
on a semiconductor wafer, poissibly separated by one or 
more layers, made by two or more lithography steps during 
the manufacture of semiconductor devices. 

BACKGROUND ART 

Mtoufacturing semiconductor devices involves 
depositing and patterning several layers overlaying each 
other. For exanple, gate interconiiects and gates of a 
CMOS integrated circuit have layers with different pat- 
terns^ which are produced by different lithography 
stages. The tolerance of alignment of the patterns, at 
eaich of the66 layers caii be sinaller than the width of the 
gate. At the time of this writing, the smallest 
linewidth that can be mass produced: is 130 nm. The state 
of the art mean +30 alignment accuracy is 30 nm (Nikon 
KrF Step-and-Repeat . Scanning System NSR-S205C, July 
2000) . 

Overlay metrology is the art of checking the 
quality of alignment after lithography. Overlay error is 
defined as the offset between two patterned layers from 
their ideal relative position. Overlay error is a vector 
q:uantity with two components in the plane of the wafer. 
Perfect overlay and zero overlay error are used synony- 
mously. Depending on the context, overlay error may 
signify one of the coitqponents or the magnitude of the 
vector . 
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Overlay metrology saves subsequent process 
stQps that would be built oh a faulty foundation in case 
df an alignment error. Overlay metrology provides the 
iAfdrmation that is necessary to correct the alignment of 
5 the stepper-scanner and thereby minimize overlay error on 
subsequent wafers. Moreover, overlay errors detected on 
a given wafer after exposing and developing the 
photoresist can be corrected by removing the photoresist 
and repeating the lithography step on a corrected 
10 stepper -scanner. If the measured error is xidnor, parame- 
ters for Subsequent steps of the lithography process 
cduld be adjusted based on the overlay metrology to avoid 
excursions, if overlay error is measured subsequently/ 
e.g., after the etch step that typically follows develop* 
15 it can be used to * scrap'' severely mis-processed wafers, 
or to adjust process equipment for better pet'formance on 
subsequent wafers. 

Prior overlay metrology miethods use built-in 
test patterns etched or otherwise formed into or on the 
20 various layers during the same plurality of lithography 
steps that form the patterns for circuit elements on the 
wafer. One typical pattern, called ""box-in-box" consists 
of two concentric squares, formed on a lower and an upper 
layer, respectively. *Bar-in-bar' is a similar pattern 
25 with just the edges of the ^'boxes' demarcated, and broken 
into disjoint line segments, as shown in Figure 1. The 
outer bars 2 are associated with oiie layer and the inner 
bars 4 with another. Typically one is the upper pattern 
and the other is the lower pattern, e.g.« outer bars 2 on 
30 a lower layer, and inner bars 4 on the top. However, 

not truly planar so the designations "^upper" and lower' 
are ambiguous. IVpically they correspond to earlier and 
later in the process. There are other patterns used for 
35 overlay metrology.* The squares or bars are formed by 
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lithographic and other processes used to make planar 
Btructxiires, e.g., chemical-mechanical planarizatioh 
<CMP) . Currently, the patterns for the boxes or bars are 
stored on lithography masks and projected onto the wafer. 
Other methods for putting the patterns oln the wafer are 
possible, e.g., direct electron beam- writing from com- 
puter memory, etc. 

In one form of the prior art, a high perfor- 
mance microscope imaging syst«i combined with image pro-- 
cessing software estimates overlay error for the two 
layers. The image processing software uses the intensity 
of light at a multitude of pixels. Obtaining the overlay 
error accurately requires a high quality imaging system 
and means of focusing it. Some- of this prior art is 
reviewed by the article •Semiconductor Pattern Overlay', 
by Neal T. Sullivan, Handbook of Critical Dimension Me-, 
trology and Process Control: Proceedings of Conference 
held 28-29 September 1993, Monterey, California, Kevin M. 
Monahan, ed,, SPIE Optical Engineering Press, vol. CR52, 
pp. 160-188. A. Starikov, D.J. Coleman, P.J. Larson. 
A.D. Lapata, W. A. Muth, in '^Accuracy of Overlay Measure- 
ments: tool and Mark Asymmetry Effects,' Optical Engi- 
neering, vol. 31, 1992, p. 1298, teach measuring overlay 
at one orientation, rotating the wafer by 180^, measuring 
overlay again and attributing the difference to tool 
errors and overlay mark asymmetry. 

One requirement for the optical system is very 
stable positioning of the optical system with respect to 
the sample. Relative vibration would blur the image and 
degrade the performance. This is a difficult requirement 
to meet for overlay metrology systems that are integrated 
into a process tool, like a lithography track. The tool 
causes potentially large accelerations (vibrations), 
e.g., due to high acceleration wafer handlers. The tight 
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space requireinents for integration preclude bull^ isola- 
tion strategies. 

The imaging-based overlay measurement precision 
can be two orders of magnitude smaller than the WaVe* 
5 length of the light used to image the target patterns of 
concentric boxes or beurs. At such small length scales* 
the image does not have well determined edges because of 
diffraction. The determination of the edge, and there- 
fore the overlay measurement, is affected by any factor 
that ch^ges the diffraction pattern. Chemidal-mechani- 
cal planarization (CMP) is a commonly used technique used* 
to planarize the wafer surface at intermediate process 
st^s before depositing more material. CMP can render 
the profile of the trenches or lines that make up the 
overlay measurement targets asymmetric. Figure 2 illus- 
trates an overlay target feature 2 which is a trench 
filled with metal. Surface 3 is planarized by CMP. The 
CMP process erodes the surface, of the overlay mark 2 in 
an asymmetric maimer. The overlay target 2 is conpared 
subsequently to target feature 4 in the overlying layer, 
which could be, e.g., photoresist of the next lithography 
step. The asymmetry in target feature 2 changes the 
diffraction pattern, thus potentially causing an overlay 
measurement error. 

In U.S. Patent No. 4,757,207, Chappelow, et al. 
teach obtaining the quantitative value of the overlay 
offset from the reflectance of targets that consists of 
identical line gratings that are overlaid upon each other 
on a planar siabstrate. Each period of the target consists 
of four types of film stacks: lines of the lower grating 
overlapping with the spaces of the. upper grating, spaces 
of the lower grating overlapping with the lines of the 
upper grating, lines of the lower and upper gratings 
overlapping, spaces of the lower and u^^er gratings over- 
lapping. Chappelow et al. approximate the reflectance of 
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the overlapping gratings as the average of the 
reflectances of the four film stacks weighted by their 
area-T fractions, ibis approxiination, which neglects 
diffraction, has sotte validity when the lin6s and spaces 
5 are larger than largest wavelength of the reflectometer. 
"The reflectance of each of the four film stacks is mea- 
sured at a so called xnacrb-slte close to the overlay 
target. Each macro-site has a uniform film stack over a 
region that is larger than the measuf Orient spot of the 

10 reflectometer. A limitation of 4,757,207 is that spatial 
variations in the film thickness that are caused by CMP 
and- resist loss during lithography will cause erroneous 
crverlay measurem^ts. Another limitation of 4,757,207 is 
that reflectance is measured at eight sites in one over- 

15 lay metrology target, which increases the size of the 
target and decreases the throughput of the measurement. 
""Another limitation of 4,757,207 is that the lines and 
spaces need to be large cornpared to the wavelength, but 
small compared to the measurement spot which limits the 

20 accuracy aiid precision of the measurement. Another limi- 
tation of 4,757,207 is that the light intensity is mea- 
sured Yyy a single photodiode. The dependence of the 
optical properties of the sample is not measured as a 
function of wavelength, or angle of incidence, or polar- 

25 ization, which limits the precision of the measurement. 

The '^average reflectivity ajpproximation for 
the interaction of light with grating^, as employed by 
U.S. Patent No. 4,757,207, greatly simplifies the problem 
of light interaction with a grating but neglects much of 

30 the diffraction physics. The model used to interpret the 
data" hasr"**f our distinct regions whose respective 
reflectivities are determined by the combination of lay- 
ers formed l:>y the sxibstrate and the overlaid patterns and 
by the respective materials in the substrate and pat- 

35 terns.' Eq. 1 in the patent clearly indicates that these 
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regions do not interact, i.e., via diffraction, as the 
total reflectivity of the structure is a single average 
of the foiir reflectivities with area weighting. 

IBM Technical Disclosure Bulletin 9 OA 60854 / 
GB8880210, M&rch 1990, pp 170-174, teaches, measuring 
offset between two patterned layers hy overlapping grat- 
ings. There are four sets of overlapping gratings to 
measure the x-offset and another four sets of overlapping 
gratings to measure the y-offset. The four sets of grat- 
ings, which are measured by. a spectroscopic 
reflectometer, have offset biases of 0, %, ?4-pitch. 
The spectra are differenced as Sa = S0-S%, Sb = S% -Si4; a 
weighted average of the difference spectra is evaluated: 
la = <w,Sa>, lb = <w,Sb>, where w is a weighting func- 
tion; and the ratio min(Ia,lbj /max(Ia, lb) is used to look 
up the offset/pitch ratio from a table. G£8d80210 relies 
on *well known film thickness algorithms* to model the 
optical interactions. Such algorithms treat the electro- 
magnetic boundary conditions at the interfaces between 
the planar layers or films. If the direction perpendicu- 
lar to the films is the z direction, the 33oundaries be- 
tween the films are at constant z^z„, where z„ is the 
location of the nth boundary. Such Algorithms, and hence 
GE880210, do not use a model that accounts for the dif- 
fraction of light by the gratings or the multiple scat- 
tering of the light by the two gratings, and it has no 
provision to handle non-rectangular line profiles.- 

In U.S. Patent No, 6,150,231, Muller et al. 
teach measuring overlay by Moir6 patterns. The Moir6 
pattern is formed by overlapping gratings patterns, one 
grating on the lower level, another <mi the upper level. 
The two grating patterns have different pitches. The 
Moir6 pattern approach requires imaging the overlapping 
gratings and estimating their offset from the spatial 
characteristics of the image. 
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In U.S. Patent Nos. 6.023,338 and 6,079,256, 
Bareket teaches an alternative approach in which two 
cdniplementary periodic grating structures are produced on 
die two subsequent layers that require alignment, /ftie 
5 two periodic structures are arranged adjacent to and in 
fixed positions relative to one another, such that there 
is no overlap of the two structures, rfhe two gratings 
.are sc&nned, either optically or with a stylus, so as to 

0 

detect the individual undulations of the gratings as a 

10 function of position. The overlay error is obtained from 
the spatial phase shift between the undulations of the 
two gratings. 

Sndth et al. in U.S. Patent Ito. 4,200,395, and 
Ono in U.S. Patent No. 4,332,473 teach aligning a wafer 

15 and a iktask by using overlapping diffraction gjratings and 
measuring higher order, i.e., non- specular, diffracted 
light. One diffraction grating is on the wafer and an- 
.other one is on the mask. The overlapping gratings are 
ill\iminated by a normally incident light and the intensi- 

20 ties of the positive and negative diffracted orders, e.g. 
1*^ and -l*^ orders, are compared. The difference between 
the intensities of the 1" and -1" diffracted orders 
provides a feedback signal which can be used to align the 
wafer and the mask. These inventions are simileir to the 

25 present one in that they use overlapping gratings on two 
layers. However, the 4,200,395 and 4,332,473 patents are 
. applicable to mask alignment but not to overlay metrol- 
ogy. They do not teach how to obtain the quantitative 
value of the offset from the light intensity measure- 

30 ments. 4,200.395 and 4,332,473 are not applicable to a 
measurement system that only uses specular, i.e., zeroth- 
order diffracted light . 

This invention is distinct from the prior art 
in that it teaches measuring overlay by scat terome try. 

35 Measurements of structural parameters of a diffracting 
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structure from optical characterization are now well 
known in the art as scat terome try. With such methods, a 
measuremeht saingple is illxaminated with optical radiation, 
and the sasple properties are determined by measuring 
5 characteristics of the scattered radiation (e.g., inten- 
sity, phase, polarization state, or angular distribur 
tion) . A diffracting structure consists of one or more 
layers that may have lateral structure within the illund- 
nated and detected area, resulting in diffraction of the 
10 reflected (or transmitted) radiation. If the lateral 
structure dimensions are smaller than the illuminating 
wavelengths, then diffracted orders other than the zeroth 
order may all be evanescent and not directly ob&ervable. 
But the structure geometry can nevertheless significantly 
15 affect the zeroth-order reflection, making it possible to 
make optical measurements of structural features much 
smaller than the illuminating wavelengths. 

In one type of measurement process, a 
microstructure is illuminated and the intensity of re- 
20 fleeted or diffracted radiation is detected as a function 
of the radiation's wavelength, the incidence direction, 
the collection direction, or polarization state (oz: a 
combination of such factors). Direction ^is typically 
specified as a polar angle and azimuth, where the refer- 
25 ence for the polar angle is the normal to the wafer and 
the reference for the azimuth is either some pattern (s) 
on the wafer or other marker, e.g., a notch or a flat for 
silicon wafers. The measured intensity data is then 
passed to a data processing machine that uses some model 
of the scattering from possible structures on the wafer. 
_For example, the model, may eitploy Maxwell's equations to 
calculate the theoretical optical characteristics as a 
ftmction of measurement parameters (e.g., film thiclcness, 
line width, etc.), and the parameters are adjusted until 
the measured and theoretical intensities agree within 
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specified convergence criteria. The initial parameter 
estimates may be provided in terms of an initial "seed' 
ittodel. of the measured structure. Alternatively, the 
optical model may exist as pre-con^ted theoretical char- 
5 acteristics as a function of one or nfore disctetized 

measurement parameters, i.e., a "library', that adsbci- 
ates collections of parametets with theoretical optical 
characteristics. The ^'extracted' structural model has 
the structural paraiiieters associated with the optical 
model which best fits the measured characteristics, e.g., 
in a least-sguares sense. 

Conrad (U.S. Patent No. 5,963,329) is an exam- 
ple of the application of scatteroinetry to measure the 
line profile or topographical cross^sections. The direct 
application of Maxwell *s equations to diffracting struc- 
tures, in contrast to non-diffracting structures (e.g., 
unpattemed films), is much more complex and time-consxim- 
ing, possibly resulting in either a considerable time 
delay between data acquisition and result reporting 
and/br the need to use a physical model of the profile 
which is veiry sin5>le and possibly neglects significant 
features. 

Scheiner et al. (U.S. Patent No. 6,100,985) 
teaches a measurement method that is similar to that of 
tionrad, excerpt that Schelner's method uses a sinqplified, 
approximate optical model of the diffracting structure 
that does not involve direct numerical solution of 
Maxwell's equations. This avoids the con^lexity and 
calculation time of the direct numerical solution. How- 
ever, the approximations inherent in the simplified model 
make it inadequate for grating structures that have pe- 
riod and linewidth dimensions comparable to or smaller 
than the illumination wavelengths. 

In an alternative method taught by McNeil et 
al. (U. S. Patent No. 5,867,276) the calculation time 
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delay is substantially reduced by storing a multivariate 
statistical analysis model based on calibration data from 
a range of model structures. The calibration data may 
come from the application of Maxwell's equatiotis to 
5 parameterized models of the structure. The statistical 
analysis, e.g., as taught in chemometrics, is applied to 
the measured diffraction characteristics and returns 
estimates of the parameters for the actual structure, 
The measurement method taught by McNeil uses 
10 diffraction characteristics consisting of spectroscopic 
intensity data. A similar method can also be used with 
fellipsometric data, using ellipsometric parameters such 
as tan v, cos A in lieu of intensity data, fox exanqple, 
Xinhui Niu in 'Specular Spectroscopic Scatterometry in 
15 DUV Lithography, " Proc. SPIE, vol. 3677, pp, 159-168, 

1999, uses a library approach. The library method can be 
used to simultaneously measure multiple model parameters 
(e.g. linewidth, edge slope, film thickness). 

In International (PCT) application publication 
20 no. WO 99/45340 (KLA-Tencor), Xu et al. disclose a method 
for measuring the parameters of a dif firacting structure 
on top of laterally homogeneous, non-diffracting films. 
The disclosed method first constructs a reference data- 
base based on a priori information about the refractive 
25 index and film thickness of underlying films, e.g., from 
spectroscopic ellipsometry or ref lectometry. The "refer- 
ence database" has -diffracted light fingerprints' or 
•signatures- (either diffraction intensities, or alterna- 
tively ellipscHnetric parameters) corresponding to various 
30 combinations of grating shape parameters. The grating 
shape-parameters associated-with the- signature in the 
reference database that matches the measured signature of 
the structure are then r^orted as the grating shape 
parameters of the structure. 



35 
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An imbounded periodic structure is one that is 
invariant under a nonzero translation in a direction wihen 
there exists a miniihum positive invariemt translatiofi in 

• 5 the said direction. Here we are concerned with struc- 
tures that are periodic in directions (substantially) 
parallel to the surface of a wafer. Here ^ wafer' is used 
to mean any manufactured object that is built by building 
up patterned, overlying layers. Silicon wafers for mi- 

10 croelectronics are a good exanple, and there are many 
others, e.g., flat panel displays. 

A one-dimensional (ID) periodic structure has 
one direction in which it is invariant for any transla- 
tion. The lattice dimension is perpendicular to the 

15 invariant direction. The smallest distance of transla- 
tion albng the lattice dimension which yields invariance 
is the pitch of the grating. Two-dimensional gratdLngs 
are also possible, with two lattice directions and 
pitches, as is well known. In this application, a peri- 

20 odic structure is understood to be a portion of an un- 
bounded periodic structure. The periodic structure is 
understood to extend by more than one period along its 
lattice axes. A grating is a periodic structure. A 
diffraction grating is a grating used in a manner to 

25 interact with waves, in pcurticular light waves. A ID 
grating is also referred to as a *line grating". 

Upon reflection by or transmission through a 
diffraction grating, light propagates in discrete direc-. 
tions called Bragg orders. For a particular Bragg order 

30 in, the component of the wavevector along the lattice 
-axis, kj^y differs from the same component of the 
wavevector of the incident wave by an integer multiple of 
the lattice wavenuxnber For a line grating. 
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+ — — ^ m=a±l.±2... 

PA 

where A and 0j are the wavelength and angle of the inci- 
dent wave in vacuum (or something effectively like vac- 
uum, e.g., air), n is the refractive index of the trans- 
parent medium that separates the two gratings . P is the 
pitch of the grating. The x-axis is the lattice axis and 
the z-axis is petpendicular to the plane of the wafer. 
The Bragg orders are referenced by the integer a. The 
Bragg orders for which V<0 are called evanescent, non- 
propagating, or cut-off. The evanescent Bragg orders 
havejpure imaginary wavexiumbers in thte z direction. 
Hence, they exponentially decay as exp(-| Im(Jc.)| z) as a 
function of the distance z, measured from the diffraction 
grating along the z-axis . 

The polar angle ^ and azimuth j are defined as 

shown in Figure 3, with respect to the lateral or in- 
plane directions x and y, and the vertical or out of 
plane direction 2. The figure applies generally to ob- 
jects that are substantially planar, or locally to curved 
objects. The orientation of the lateral directions x and 
y may correspond to physical features on the wafer, e.g. 
structures 5 deposited or formed on the wafer (sub- 
strate), or actually part of the substtate, e.g., a wafer 
notch. 

The spot of an optical instrument is the region 
on a sample whose optical characteristics are detected by 
the instrument. The measurement system can translate the 
location of the spot on the sample, and focus it, as is 
well known in the art. 



wo 02/0(5545 PCTaJS02/04190 

-13- 

DISCLOSURE OF INVElfriON 

The present invention measures the overl^iy 
error of layers on a wafer with low- resolution optics. 
The basic overlay metrology target used in the present 
5 invention coitprises a pair of overlapping diffraction 
gratings, i.e., a lower grating on a lower (or earlier 
formed) layer and an upper (or later formed) grating. 
The spot of the optical instrument preferably covers many 
periods of the gratings and it does not necessarily re- 

10 solve the lines of the grating. The overlay error is 
measiired by scatterometry, the measurement of optical 
characteristics, siich as reflectance or ellipsometric 
parameters, as functions of one or more independent vari- 
ables, e;g., wavelength, polar or azimuthal angles of 

15 incidence or collection, polarization, or some cozobina- 
tion thereof. 

It is an object of the present invention to use 
scatterometry to accurately measure overlay error. It is 
also an object of the invention that thi^ accurate over- 

20 lay measurement be obtained even when the profile of the 
grating lines has been altered or rendered asymmetric by 
a process such as chemical -mechetnical planar ization. An 
instrument meeting these objectives has utility in stan- 
dard planar/photo-lithographic technology used for micro- 

25 electronics manufacture, as well as other technologies 

using multiple patterned layers. This has the advantage 
that the sarnie measurement hardware used for other optical 
measurements, e.g., line profiles or film thicknesses, 
can be used for another critical measurement, that of 

30 overlay. 

The method includes the steps of laying down a 
first grating during a first step of manufacturing (mak- 
ing) a planar structure, laying down a second grating 
during a second manufacturing step so that the second 
35 grating substantially overlaps the first grating (later- 
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ally, in x and y) , then illuminating at least a portion 
of the region of overlap* detecting radiation that has 
interacted with both gratings, and inverting for the 
offset between the gratings as a parameter of a model. 
5 The critical dimension (CD) and line profile also may be 
measured, simultaneously or with additional, similar 
measuring and data processing st^s. 

It is another object of the present invention 
to describe an apparatus for practicing the above method. 
The apparatus comprises an instrument receiving a sample 
and including a source of illumination and a detector 
that detects light which has interacted with the sample. 
The sanqple coirtjrises a first grating fabricated at one 
stage of making a planar structure and characterized by a 
first pitch, a second grating with a second, possibly 
substantially identical, pitch that is formed during a 
second stage such that the second grating substantially 
overlaps the first grating in the lateral dimensions. 
The pitches of the gratings and the parameters of the 
instruiment are chosen such that some energy in one or 
more non-zero orders diffracted by one of the gratings 
proi>agates in the sai^ple media between the two gratings 
and reaches the other grating. The instrument is suit- 
able for also measuring CD and line profile, as well as 
the overlay measurement mentioned above. 

■ It is underistood that * optical' means employing 
one or more wavelengths of electromagnetic radiation in 
the CJV, visible, or infrared portions of the spectrum. 
It is also understood that each Bragg order has a range 
of propagation angle and a range of wavelength, given the 
nature of the instrument, e.g., numerical aperture (NA) 
and detector or source wavelength resolution. 

It is another object of the present invention 
to measure overlay error with an optical instnmient inte- 
grated into a process tool. This method and apparatus 
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overcomes the difficulties associated with vibrations 
caused by the process tool and the limited space avail- 
able for vibration dainping. The apparatus con^rises a 
process tool with at least one process chaihber and a 
5 sazople handler, an optical system in operative consmmica- 
tion With the process tool, a conqputer equipped with an 
inverse model for interaction of light between two grat- 
ings where at least one parameter of the model is a lat- 
eral offset between two gratings. 

10 It is another object of the present invention 

to measure the overlay error by coit^jaring the optical 
characteristics of grating pairs with substantially dif- 
ferent perfect-overlay offsets. This reduces the depend- 
ence of the measurements oh ancillary properties of. the 

15 sample. It also reduces the burden oil inverse scattering 
calculations. 

It is another aspect of the present invention 
to increase the range of unambiguous overlay error mea- 
surement firoin overlaying gratings. One approach is to 

20 offset symmetric gratings by one fourth of the grating 
pitch when the overlay error is zero, so that positive 
and negative overlay errors have the least ambiguity, 
regardless of the optical system. Another approach to 
extend the range of unambiguously detectable overlay 

25 errors is to make at least one of the gratings in the 

pair substantially asymmetric, that is to have the unit 
cell of its pattern asymmetric. Another approach is to 
combine a scatterometry measurement of offset with an 
imaging measurment of offset (similar to the prior art, 

30 e.g., using box-in-box) . A fourth approach is to have 
grating pairs with different pitches, preferably in a 
substantially irrational ratio, to measure the same com- 
ponent of overlay error. These four approaches may be 
used either separately or in combination to extend the 

35 range of unambiguously detectable overlay errors. 
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BRIEF DESCRIPTION OP THE DRAWINGS 

Figure 1 is a top plan view of a box«in*box 
pattern used for overlay metrology of the prior art. 

Figure 2 is a side sectional view of a wafer 
portion having the prior art overlay metrology pattern of 
Figure 1, illustrating a test pattern that has been ren- 
dered asymnetric by a planarization (CMP) process. 

Figure 3 is a perspective diagram illustrating 
the definition of angle of incidence 6, and azimuth 
angle ^ as used herein. 

Figure 4 is a diagram of the measurement in- 
strument in relation to the test patterns. 

Figure 5 is a top view of a simple first em- 
bodiment of test patterns according to the present inven- 
tion, the patterns being in the form of two sets of over- 
lapping gratings placed in an inactive area on a wafer 
for measuring respective x and y conponents of the over- 
lay. 

Figure 6 is a cross sectional view of one of 
the test patterns in Figure 5, showing the overlapjping 
diffraction gratings. 

Figure 7 is a cross sectional view like Figure 
6 exc^t that the profile of the line features of the 
lower grating have been rendered asymmetric by a 
planarization (CMP) process. 

Figures 8a-8c are side schematic views showing 
how. a grating pair with symmetric gratings gives unambig- 
uous overlay error indications over a range of one half 
the grating's period. Figure 8d is a graph of coverage 
function versus indicator offset A for the grating pairs 
in Figures 8a-8c. 

Figure 9 is a side scheinatic view of a portion 
of the grating pair of Figure 6 illustrating the configu- 
ration and dimensions used in the nximerical study in 
Figures lOa-lOd and 11. 
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Pigures 10a to lOd are graphs of reflectance 
versus wavelength when the registration error in tihe 
configuration of Figure 9 is respectively ±8nin, ±32nm, 
±64nm, and ±128nin, where the grating period in each case 
5 is 512nm. Reflectance versus wavelength for zero offset 
is used as a comparative reference curve in each of the 
graphs . 

Pigiure 11 is a graph of reflectance change per 
offset change (dR/dA) versus wavelength, i.e. spectral 
10 sensitivity to overlay error / for different grating 
pitches (256n]n^ 512nm and 1024nm) • 

Pigure 12 is a side cross sectional view of a 
test pattern of overlapping diffraction gratings, as in 
Figures 6 and 9 , except that the gratings have an asym- 
15. metric line width and spacing configuration. Preferred 
nominal dimensions for the calculation used to produce 
the graphs in Figures 14 and 15a-15k are also indicated. 

Figures 13a and 13b are side cross sectional 
views of test patterns as in Figure 12, but with respec- 
20 tive right and left overlay offsets, illustrating the 

ability to distinguish and measure small, opposite over- 
lay errors. 

Figure 14 is a graph of reflectance versus 
wavelength at noirmal incidence for the test' pattern of 
25 Figure 12 with perfect overlay alignment. 

Figures 15a to 15k are graphs of the difference 
in spectral reflectance relative to the values in Figure 
14 for overlay errors of ±lnm, ±2nm, ±5nm, ±10nm, ±20nm, 
±50nm, ilOOnin, ±200nm, ±300xun, ±400Dm, and ±500nm, respec- 
30 tively. 

Figure 16 is a graph of linear estimate of 
overlay as a function of the actual overlay. 

Figure 17 is a plan view of a quasi-one-dimen- 
sional, asymmetric grating. 



WOQ2/IKSS4S 



PCTADSOimiM 



-18- 

Figure iS is a bcheinatic side view showing 
parameters for grating lines with asyinnetric profile. 

Figures 19 and 20 are flow diagrams for twp 
methods in accord with the present invention for using 
5 the parameter's in Figure 18 to calculate the overlay 
error. 

Figure 21 is a schematic side view of an alter- 
native test pattern for differential measurement of 
alignment offset which is insensitive to geometrical and 
10 material properties of the gratings. 

Figure 22 is a top view of an alternative em- 
bodiment that uses a three-dimensional grating. 

Figure 23 shows mirrored images of the three- 
dimensional grating of. Figure 22 *diich can be used with 
15 that grating to reduce sensitivity to geometrical and 
material properties of the gratings. 

Figure 24 shows a top schematic view of a pro- 
cess tool with a metrology system suitable for practicing 
the current invention. 

Figure 25 is a cross sectional view of one of a 
test patterns where, although the material between the 
two gratings is lossy, there is sufficient physical indi- 
cation of the lower grating to affect the optical charac- 
teristics and allow the measurement of overlay. 



20 



25 



30 



35 



BEST MODE OF CARRYING OUT THE INVENTION 

Referring to Fig\ire 5, in the sinplest embodi- 
ment of the present invention, two test patterns 10 and 
20, each having a pair of overlapping gratings, are 
placed in a region on the wafer that does not interfere 
with the devices , that are being manufactured. For exam- 
ple, the test patterns can be placed on a scribe line 7 
between the dies on a wafer. Test pattern 20 is similar 
to test pattern 10 rotated by 90 degrees. Each of the 
teat patterns 10 and 20 consists of two overlying grat- 
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ings 30 and 32 diagrairanatically shown in cross section in 
Figure 6 or 7. Figure 7 differs from Figure 6 only in 
that the linie features in lower grating 30 have an asym- 
metric profile, e.g. due to a chemical-mechanical 
5 plaixarisation (CMP) process. Grating 30 is formed on the 
lower layer r i.e., at. an earlier stage of fabrication. 
Grating 32 is subsequently formed on the upper layer, 
which needs to be well aligned laterally with the lower 
layer. There may be one or more layers 31 between grat- 
10 ings 30 and 32. The upper and lower layers may overlap in, 
the vertical direction z due to a lack of planarity in 
the layer manufacture. The layers 31 are transparent or 
partially transparent to light, at least in part of the 
wavelength spectrum detected by the optical instrument. 
15 Referring to Figure 4, the test patterns 10 and 

20 are nieasured by an optical instrument 40, preferably 
sequentially. The optical instrument 40 can be virtually 
any optical instrument that illuminates the sample and 
records at least one property of light that has inter- 
20 acted with the san^le. The instrument preferably oper- 
ates in reflection mode. Brhbodiments include 
reflectometers and ellipsometers, which are well known in 
the art. A ref lactometer measiires some function of the 
intensity of light reflected from the sample. In a pre- 
25 f erred Embodiment, the optical instrument measures spec- 
tral ref lectance R. Stanke et al. give a conplete de- 
scription of such an optical instrximent in U.S. patent 
application no. 09/533,613, Apparatus for Imaging- Metrol- 
ogYi which is incorporated herein by reference. 
30 There are many other instruments described in 

the literature suitable for alternative embodiments. An 
ellipsometer measures some .function of the contplex ratio 
rp/rs of the coinplex reflection coefficients for the P and 
S polarizations. Piwonka-Corle et al. describe in detail 
35 a suitable ellipsometer for practicing the current method 
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in U.S. Patent No. 5,608,526, Focused Beam Spectroscopic 
tliipsometry Method and System, which is incorporated 
herein by reference. Other el lipsometers could also be 
used. The optical electric field is parallel and perpen- 
dicular to the plane of incidence for the P and S x>olar- 
izations, respectively. Typically ellipsometers report 
the ellipsometric parameters and A wherein 
rp/rgsstanCFje^*. Other parameterize tions of the results 
from ellipsometxy are possible. For example the rota- 
tional Fourier coefficients of intensity measured by a 
rotating-conpensator ellipsometer, as discussed in 
"Broadband spectral operation of a rotating-^compensator 
ellipsometer", by Qpsal et al.. Thin Solid Films, 313-314 
(1998), 58-61. 

In all embodiments « measurements are made as 
functions of one or more Independent optical variables. 
Independent optical variable^ can include the wavelength 
Ar, polar angles G, azinuthal angles ^ and polarization 
states, for incident and scattered light. Different 
embodiments may include any combination of the properties 
of incident and detected lights similar to those dis- 
cussed above, at einy conibination and range of the inde- 
pendent optical variables X, 6, The preferred embodi- 
ment for integration in process tools uses wavelength X 
as the independent variable. 

Various transformations of the above mentioned 
independent variables may serve as an independent vari- 
able. In a sin{)le case, wavenumber may be used instead 
of wavelength. In another case, each ''wavelength^ may 
actually consist of a combination of many wavelengths, 
e;g., due to the finite resolution of the instrument. 
Other more complex transformations are also possible. 

The preferred optical instrument contains a 
broadband light source 42 and a spectroscopic detector 
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44. The vraivelength spectrum of light source 42 and the 
spectral sensitivity of detector 44 overlap substan- 
tially. The spot 46 of optical instrximent 40 is prefera- 
bly coni)letely contained in the gratings 10 and 20 1 one 
5 at a time. Alternatively, the spot may be sensitive to a 
regibn on the wafer that, contains other zones, e.g., a 
2 one surrounding an overlay pat tern # and the data inter- 
preted accordingly, e.g., with the method described in 
U.S. patent application no. 09/735,286 or in U.S. Patent 

10 No. 6,100,98S- The size of spot 46 is preferably many, 
times the grating period. The meastirement is substan- 
tially insensitive to lateral shift or vibration of the 
sainple, especially when spot 46 is contained in one of 
the test patterns. In a preferred embodiment, the' diame- 

is ter of the spot is typically 40 jim, gratings 10 and 20 

are 80 (Wi by 80 jim each, the pitches of all the gratings 
are 0.5 - 1.0 jiiti (with 1.0 fim being preferred), and the 
wavelength interval is 250 nm to 800 nm. The preferred 
angles of incidence and detection are substantially at 9 

20 - 0, with the illumination NA equal to 0.14 and detection 
NA equal to 0.07. For such a ''noznal incidence' instru- 
ment, the angle is preferably indetenhinate . The in- 
vention is not limited to these particular optical param- 
eters • 

25 The optical measurement does not rely on imag- 

ing or scanning the patterns 10 and 20. The detector 44 
need not have pixels that correspond to different posi- 
tions on the wafer. The measurement is ideally independ- 
ent of the position of spot 46, especially when the spot 

30 is coii^)letely contained within grating area 10 or 20. 
. Even if the spot is not contained within the grating 
area, the sensitivity to precise placement of the spot 
with respect to the grating is weak and does not preclude 
a useful measurement of overlay. 
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Because the diffraction gratingf 30 is contained 
in the lower or earlier formed layer and the diffraction 
grating 32 ia contained in the iipper or later formed 
layer ^ the position of grating 32 relative to grating 30 
depends on the alignment offset of the two layers. The 
way the Bragg orders interfere depends on the amount of 
the lateral offset between the two gratings. Hence, the 
observed reflectance from the test pattern 10 depends on 
independent variables (e.g., wavelength) and the overlay 
error of the two layers along the x-axis. Overlay error 
can be deduced from the characterization of reflected 
light as a function of independent variable (s), as de- 
scribed below. Similarly, the reflectance from grating 
pattern 20 depends on the overlay error of the two layers 
along the y-axis. In the preferred entbodiment, the de- 
tector 44 performs a measurement on the 0-th firagg order, 
i.e., Oi = 9d, although the invention is not specifically 
limited to detecting the 0-th order. 

O^he measurement depends on optical interaction 
of the two gratings. The gratings interact through Bragg 
orders. Some Bragg orders are propagating, and some are 
evanescent or non-propagating. Depending on the degree 
of evanescence and the distance between the two gratings, 
evanescent orders may contribute ' to this interaction. 
However, in the preferred embodiment, at least two orders 
are propagating in region 31 between the two gratings. 
Generally, the zeroth order will be propagating. This 
will always be the case if the refractive index (indices) 
of the material (s) between gratings 30 and 32 are greater 
than or equal to the refractive index of the medixim that 
contains -the device under test, or. wafer. In order for a 
(positive or negative) first order to be propagating in 
the region between the two gratings: 

f 2;rm 2;r . ^ Y f 2m^ \ 
+— sm^; < — form=-H or /n=-l 



wo 02/065545 



PCT/US02/D4190 



-23- 



in cases v^re the iitiaginary part of the refractive index 
n is zero or negligible. For normal incidence^ we have: 

n 

5 In the equations above, n is the refractive index of 

layers 31 between the two gratings 30 and 32. If there 
are several layers 31, n is the refractive, index of the 
least refractive layer. If the largest wavelength in the 
spectroscopic measurement is 790 im, the transparent 

10 medivun between the two gratings is SiOai and the measure- 
ment instrument . operates at normal incidence 
(qz = qo 6 0), then the pitch is preferably no less than 
541 nm. Otherwise, at least some of the spectrum will be 
insensitive to the overlay. 

15 When the layers between this gratings are lossy, 

and the refractive index n has an imaginary part, all the 
orders are attenuated to some extent as they propagate 
through the lossy medi\im. However, in practice, a first 
order will give the desired interaction as long as the 

20 attentiation ratio through all intervening layers of 
thickness t 



exp 



is small compared to 1. denotes imaginary part of 

the complex variable u. 
25 " In order to describe parts of the invention, it 
is useful to introduce an indicator offset and a coverage 
function of the indicator offset which is not an essen- 
tial part of the invention. The following discussion 
concentrates on finding one conponent of overlay, x for 
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example. The sdne would apply to the second component in 
the direction y. Figure 8a shows one period P of a grat- 
ing pair comprising lower grating 81 and upper grating 83 
with zeto offset Do - 0 between the left edge of line 85 
in lower grating 81 and the left edge of line 87 in upper 
grating 83. Left and right are used to distinguish the 
negative and positive directions along the axis under 
discussion. Por this example, the upper and lower grat- 
ings have the same pitch and the same linewidth. Figures 
8b and 8c show different values of the indicator offset 
a*id pj. In Figure 8c it is apparent that the upper grat^ 
ing is periodic, as the portion of upper line 87a has 

entered, period P frbm the left and some of portion 87b 
has exited P. due to indicator offset D,. ifce lower ' 
grating is also periodic, although it is not apparent in 
the figure. 

Figure 8d shows the coverage function for this 

grating pair, the relative proportion of lower line 85 
covered by upper line 87. A value of unity for the indi- 
cator function indicates that the upper line covers all 
of the lower line. 

For this particular grating pair, an optical 
system that has substantial left/right syntaetry, cannot 
distinguish offsets D and -D. This will be true for many 
optical systems, e.g., one operating at normal incidence, 
and others as well. This will also be true for many 
grating pairs, especially when the individual gratings 
have left/right symiiietry. in these cases the system can 
at best uniquely resolve offsets over a range of half a 
period, i.e., -0 < D < P/2. m order to allow similar 
ranges of negative and positive overlay error, the grat- 
ing pair is preferably designed so that D s ± p/4 for 
perfect overlay. Referring to Figure 6, in order to 
distiftguish overlay in the +x and -x directions, the 
gratings 30 and 32 are preferably offset with respect to 
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each other when the two layers haye perfect (zero) over- 
lay. In the preferred implementation, gratings 30 and 32 
are offset by a quarter period at perfect overlay. 

Figures 10a to lOd show exai^ples of theoreti- 
5 cally calculated reflectances for various overlays of the 
gratings in Figure 6 that demonstrate the ability to 
distinguish positive and negative overlay. Figure 11 
show3 that a sioaller pitch gives greater sensitivity to 
overlay as long as the first Bragg order is propagating. 
10 Figure 9 shows the configuration and the dimensions of 

the gratings used in the numerical example shown in f'ig- 
ures lOa-lOd and 11 . The two gratings are designated to 
be offset from each other by a quarter period when the 
two layers are perfectly registered. 
15 It is advantageous to use a grating pair with 

at least one asymmetric grating. As discussed above, 
symmetric gratings with an optical system that does not 
distinguish left and right gives a maximum range of unam- 
biguous offsets of plus and minus one quarter of the 
20 pitch. For many optical systems* including the preferred 
embodiment, the gratings' optical characteristics may be 
the only 'reference' to distinguish left from right. 
Figure 12 shows a preferred embodiment of a grating pair 
with two asymmetric gratings. Here the asymmetry refers to 
25 the different widths and spacing of the grating lines, 

rather than an asyzraaetry in the profile of the individual 
lines of a grating. Both lower grating 120 and upper 
grating 122 have the same pitch P. The pitch P may be 
nominally 1 micron. Both gratings have narrow lines 123, 
30 narrow spaces 124, wide lines 125 and wide spaces 126 in 
one unit cell, i.e., one pitch P. The narrow lines and 
spaces may be all nominally 160 nm wide. The wide lines 
and spaces may be all nominally 340 nm wide. Lower grat- 
ing 120 has polysilicon lines separated by oxide spaces 
35 and may be nominally 93 nm thick (or high) • Upper x^grat- 
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ing 122 may have nominally 380 nm high photoresist lines 
with air spaces. Lower grating 120 rests on gate oxidfe 
115 which in turn lies upon silicon substrate 110, 
Interlayer dielectric 121 is typically a silicon dioxide 
preparation such as TBOS or BPSQ. Other dimensions and 
materials could be used. 

While the preferred embodiment refers explic- 
itly to polysilicon structures in the lower grating, as 
are currently used for gates) many other structures are 
possible, e.g., for isolation trenches or metal lines 
embedded in interlayer dielectric, as is well known in 
the art. Also, the upper grating in the preferred em- 
bodiment contains photoresist, but alternative embodi- 
ments may have alternative structures, like etched struc- 
tures . 

Figure 13a shows grating pair 130 with small 
offset Do of the upper grating to the right with respect 
to the lower grating. Figure 13b shows grating pair 135 
with its upper grating having a shift to the left with 
respect to the lower. These are shifted versions of the 
grating pair in Fig\ire 12, which shows the preferred 
shift (between upper and lower gratings) for perfect 
overlay. The upper and lower gratings in that figure are 
aligned, which would render small positive and negative 
overlay errors ambiguous if the gratings were symmetric, 
as discussed above, for an Optical* system without 
^left/right sensitivity. However, close examination of 
Figures 13a and 13b, and simple heuristic arguments show 
that anibiguity is not necessarily the case for this pre- 
ferred embodiment. For example, the left edge of lower 
narrow line 132 lies directly below upper wide space 133. 
This is a distinctly different configuration than in 
Figure 13b were the right edge of lower wide line 137 is 
directly below upper wide space 138. Therefore, the 
optical response characteristics for small left and right 
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shifts are distinguishable, and indeed for any shifts 
irtddulo one period. The prefeinred embodiment with two 
asyinmetric gratings has them perfectly aligned Ciri* 
phase*, spatially) for perfect overlay. Alternative 
5 embodiments have oth^ alignments between the upper and 
lower gratings tor perfect overlay. 

Figure 14 shows the calculated spectral 
reflectance at normal Incidence for the structure iri 
Figure 12 at pierf^t overlay alignment. The calculations 

10 in this excixnple are based on the nominal preferred dimen- 
sions shown in Figure 12. Figures 15a through 15k show 
the change in the calculated spectral reflectance from 
that of perfect overlay in Figure 14 for overlay errors 
of ±lnm, ±2nm, iSnm, ±10nm, ±20nm, ±50nm, ±100nm« ±200nm, 

15 ±300nm, ±400xim, and l500n^, respectively. The graphs 
show the ability to distinguish positive and negative 
overlay error up to, but not including, overlay errors of 
one-half of the grating pitch. Figure 15k shows that for 
a pitch of lOOOnm, the results of +500nm and -SOOnm over- 

20 lay are indistinguishable. Figure 16 shows the linear 
estimate of overlay as a function of actual overlay. 

The preferred method of introducing asymmetry 
into the gratings is to use multiple lines and spaces in 
the gratings per period as discussed above. The advan- 

^5 tage is that the desired asymmetry is likely to stay 

intact regardless of process parameters. However, there 
are and will be many other methods to introduce asymmetry 
into the griatings used for overlay measurement. This is 
especially true for advanced and future processes. For 

30 exan^le, some micro-machining techniques use gross under- r 
"cut, and the asymmetry can be introduced in the undercut. 
Alternatively, effective asymmetry can be introduced by 
intentional ''imperfections'. For example, in Figure 17, 
grating 170 is made of features 172 that are nominally 

35 lines, but they have asymmetric features 174 that break 
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the reflection syiiimetry in lattice dimension x. The 
optical model for the structure 172 might approximate it 
as a one-dimensional grating, with some '^perturbatioii* on 
one edge. Offsetting individual lines by different 
amounts in y could improve the validity of such an ap- 
proximation. The averaging of the optical system along 
the invariant direction would support such approximation. 
Alternatively, asymmetry may be introduced not in the 
patterns for the structures, but by )cnown process charac- 
teristics. For example, CMP currently is known to intro- 
duce asymmetry in gratings. Controlling (or knowing) 
this asymmetry locally can give the desired asymmetry to 
the overlay metrology structure, to resolve the ambigu- 
ities associated with offset by half a period. 

Referring again to Figure 4, a camera 48 and 
image recognition software may be used to position siK>t 
46 so that it is contained in diffraction grating 10 and 
20, one at a time. (Note that the schematic drawing is 
not to the preferred scale, e.g., the spot preferably 
senses many periods of the gratings.) Either the optics 
of instrument 40, the stage that holds the wafer or both 
are movable. A con5>uter code assesses the relative. posi- 
tion of the wafer and optics based on the image from 
camera 48 and translates the wafer and/or the optics 
until the desired alignment is achieved. The tolerance 
of this alignment is large, on the order of 1 to 10 mi- 
crometers, i.e. greater than the desired overlay preci- 
sion. The tolerance need not be comparable to the de- 
sired accuracy or repeatability of the overlay measure- 
ment. Camera 48 is used only to find the measurement 
site. It does not contribute to._the_data that is used to 
measure the overlay error with high precision. . However, 
camera 48 can be used to measure gross overlay errors 
that exceed plus or minus half the period of the diffrac- 
tion gratings 120 and 122 (Figure 12). The of f set mea- 
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sured using the test patterns 10 and 20 is uncertain up 
^- to an integer multiple o£ grating i>eriods, if the upper 
and lower gratinjgs 120 and 122 are substantially asymmet- 
ric. For syxraaetric gratings^ e.g.^ 30 and 32 in figure 
5 6, the offset is uiicettain tsp.to an integer multiple of 
half grating periods. Any low- resolution overlay error 
measur^ent could be used to resolve this anibiguity. 
This uncertainty is preferably removed by using camera 48 
and a conventional box-in-box or bar-in-bar pattern in 

10 addition to test patterns 10 and 20. 

Alternatively, x-uncertainty in the overlay 
measurement along the x-axis can be reduced by providing 
two test, structures 10a and 10b, each similar to test 
structure 10, but having different grating periods. The 

15 ratio of the periods is preferably an irrational nusnbetg 

Ji 

for example The same approach can be used in the y 

direction, e.g., with two test structures 20a and 20b in 
place of structure 20 to measure the offset along the y- 
axis . 

20 Referring again to Figures 6 and 7, in addition 

to the overlay error and the wavelength, the diffraction 
characteristics euid optical response of the test struc- 
tures depend on the geometric and material properties of 
gratings 30 and 32, intermediate layers 31, and substrate 

25 or underlying layers 29. Overlay metrology requires the 
knowledge of these parameters. Material properties are 
preferably obtained by performing ellipsometric measure- 
ments on films of these materials deposited on well char- 
acterized substrates such as silicon wafers as a separate 

30 step to actually measuring overlay . error . 

The geometric parameters of the gratings and 
the films are preferably obtained from the spectroscopic 
data by regression, e.g., fitting a model to the data by 
nonlinear least squares. Referring, for example, to 
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Figure 6, the niodel for . interaction of light with the two 
gratings preferably allows e^licitly for the volume 
nature of the grating, and for boundaries between materi- 
als of differing properties in at least two dimensions . 
Thus the model allows esqplicitly for variations in at 
least two dimensions. The preferred model is rigorous 
coupled wave analysis, similar to the models eiUployed in 
patents 5,963,329 and 5,867, 276, Alternative models for 
electromagnetic scattering from a volume include, e.g., 
the finite element method, the boundary integral method. 
Green's function formulations of scattering from volumes, 
etc. Such models account for dif fraction from all bound- 
aries in the grating volume. When treated with rigorous 
coupled wave analysis, multiple interactions between the 
two gratings, via their respective diffracted orders, are 
e^qplicitly modeled. While a method like the finite ele- 
ment model does not use the same formulation, it can 
accurately account for the same effects. Well taown 
thin-film models, which are essentially one dimensional 
in nature, cannot fully account for the diffraction that 
takes place. 

Figure 18 shows a parameterization for the 
preferred model of overlay and line profiles of two dif- 
fraction gratings 30 and 32. Parameters Xo, x^, 
describe the two grating lines and their offset (X4) . In 
this way, calculating the optical response of the over- 
lapping gratings on a sair^le can take into account the 
profiles of the grating structures, including asymmetries 
caused by manufacturing processes. One embodiment of a 
nonlinear least squares fit operation, as shown in Figure 
19, determines (i.e., estimates) these.. unknown parame- 
ters. In this exazqple, the asVnanetry of grating line 32 
is accounted by the two independent parameters X2 and x,. 
In Figure 19, reflectometry or ellipsometry measurements 
as a fxinction of one or more independent variables (wave* 
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length 1« incidence or collection angle q, incidence or 
collection azinnitli f« etc.) are performed 19 1-^ An opti- 
cal response £or a specified set of overlay and profile 
parameters is calculated 192 and compared 193 with the 
5 in:easurements . The parameters are continually changed 194 
in order to minimize the difference bet%ra»en the calcu- 
lated response and the measurements. Once a best match 
is found 193 « the overlay (and optionally, the CD and 
profile) is reported 195. 
10 Many estimation methods and variations are 

suitable. E.g., theoretical speictral models correspond- 
ing to various alignment offsets and grating parameters 
can be pre- confuted and saved in a library. Hhe align- 
ment offset as well as gifating parameters can be obtained 
15 by finding the model in the library that matches the 
measured spectrum most closely. This approach uses a 
single grating pair 30 and 32 to determine a single com- 
ponent of offset error. It is preferred, e.g., over the 
method using a pair of grating pairs, described below, to 
20 keep the 'real estate' on the wafer required for test 

patterns to a minimum. A flow diagram of one such algo- 
rithm is shown in Figure 20. A database or library of 
optical responises is pre-coi^puted 200 for overlapping 
grating structures with several values of overlay and 
25 profile parameters. Then, as before, reflectometry or 

ellipsometry measurements are performed 201 on a saxnple's 
test pattern. The values stored in the library are used 
to calculate 202 a theoretical optical response, which is 
conpared 203 with the measured response. The values in 
30 the library may optionally be the desired theoretical 

optical response, -quantities used to facilitate the cal- 
culation of such a response. Parameters are changed 204 
and updated theoretical responses are calculated 202 
using the library until a *best' match is found 203. The 
35 overlay (and optionally the CD and profile parameters) 
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are th&i reported 205. In a further refinement, the 
response of the overlapping gratings can be obtained at 
measurement time by interpolating between discrete en- 
tries in the database. 

In other embodiments, samples of either. one or 
the other of the two overlying gratings used to measure 
overlay error is available without its mate, on some 
portion of the wafer. The method adds one or more steps 
for measuring the optical characteristics of single grat- 
ings (as opposed to Overlying pairs), and possibly for 
measuring parameters of single gratings, to constrain the 
measurement of overlay error on the pair of gratings. In 
some cases this may involve storing the optical response 
characteristics from a previous process step in the fab- 
rication of the wafer, e.g.. for the lower grating in the 
pair of gratings. 

An alternative, preferred embodiment of the 
method that is less sensitive to wafer- to- wafer varia- 
tions in the geometric and material properties of the 
test structures uses, for the x direction, two gratings 
as shown in cross section in Figure 21. In this ap- 
proach, two spectroscopic measurenents, one on test 
structure 210a. and another one on test structure 210b 
that is adjacent to 210a, yield offset along the x-axis. 
as disciissed in detail below. The same approach is pref- 
erably applied to another direction, e.g., along the y- 
axis. Gratings 212a and 212b are mirror images of each 
other. Similarly, Gratings 214a and 214b are mirror 
images of each other. At least one of the gratings 212a 
and 214a in test pattern 210a iare asymmetric. Similarly, 
at least one of the gratings 212b and 214b in test pat- 
tern 2i0b are asymmetric. There are two similar struc- 
tures, not shown in Figure 21, with the lattice dimension 
in the y-direction. to measure the offset along the y- 
axis. The geometric and material properties of test 
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structures 210a and 210b are stibstantially similar be- 
cause the two test structures are located close to each 
other and the same process steps produce them. 

At perfect overlay, grating 214a is offset from 
5 grating 212a along the x-axis by *Do, and grating 214b is 
offset from grating 212b by -fDo along the x-axis. Hence, 
they are mirror images. Viewed by un-polarized 
reflectometry at normal incidence, e.g., by the preferred 
instrument, the test structures 210a and 210b have the 

10 same reflectance by synnnetry. As the overlay error in- 
creases, the reflectance of the test structures 210a and 
210b change differently. The differehce of the 
reflectance spectra from 210a and 210b is indicative of 
the offset between the two layers. The difference is 

15 zero at perfect alignment eveA if the grating properties 
change from wafer to wafer or within the wafer, as long 
as they are the same for the two neighboring sttuctures. 
The difference in the spectral reflectance of gratings 
210a and 210b is proportional to overlay error D for 

20 small (on the order of 0.1 mm) overlay errors: 

/?;fc(A.A)-/^o,(A.A) «2— (A)A 
oA 

A 

The maximum likelihood estimate A of overlay error assum- 
ing the above mathematical model and random zero-mean 
Gaussian noise is: 



25 A= 



dA J 
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This is one of the many possible linear estimatoi-s of 
overlay error. Another one, for example, is" the average 

of the spectral difference ^cfcC'^.^-^^wC^.A) . Any linear 
functional of the spectral difference will be propor- 
tional to the alignment offset for small offsets. Once 
the proportionality constant is known, small offsets are 
rapidly calculated at measurement time. This eliminates 
the need for inverse diffraction calculations or searches 
in a pre-computed library. The proportionality constant 
between the norm of the spectral difference and the 
alignment offset is preferably determined by solving 
Maxwell's equations on a theoretical model of the test 
structure before the measurements. Alternatively, the 
proportionality constant can be determined enpirically. 
Or, the proportionality constant itself can be a function 
of some other measured parameter or parameters on the 
wafer, e.g.. a critical dimension, a layer thicJcness. or 
an optical property. Alternatively, the function relat- 
ing the measure of the spectral difference may be a more 
complex function of overlay error, e.g.. a polynomial or 
some other empirical function based on theoretical model 
or controlled measurements. Alternatively, the data 

measured at 210a and 210b. R,^a,l!^A<^M , are inverted 
for the overlay error simultaneously, with an algorithm 
similar to that described in conjunction with Figure 20. 
This inversion can be more stable or more efficient than 
for an inversion of either or both gratings alone, since 
it effectively removes or de-emphasizes inversion parame- 
ters other than overlay error. 

The embodiments described above for pairs of 
anti-symmetric gratings pairs (at zero overlay) use 
reflectances at multiple wavelengths as the optical char- 
acteristics. Similar arrangements of gratings can be 
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used with other optical characteriBtics and/ or measure- 
scent instmments in yet alternative enibodinents to mea- 
sure overlay with reduced sensitivity to ancillary pro- 
cess parameters. E.g.# an ellipsometer can measure the 
5 optical characteristics o£ the pair of grating pairs to 
be compared. Both grating pairs will be affected in 
substantially the same manner by ancillary changes^ yet 
will be affected in opposite ways by the offsets associ- 
ated with overlay error. 
10 Alternatively, instead of using separate line 

gratings 10 and 20 to measxire the x €Uid y components of 
the overlay error, a two-dimensional -grating 220 may be 
used as sho%m in Figure 22 to obtain both x and y compo- 
nents of the offset simultaneously. In the preferred 
is embodiment, at least one of the upper and lower gratings 
is asymmetric in both x emd y directions, as shown in 
Figure 22. Furthermore, the pattern is different in x 
and y directions; i.e., the pattern is not self similar 
under ±90° rotations in the plane of the wafer. In one 
20 preferred embodiment, as shown in Fig\ire 23, there are 

three gratings, an original 230a, one 230b mirrored in x, 
and one 230c mirrored in y, to reduce sensitivity to 
parameters other than overlay error. Alternatively, use 
of a single t%i70- dimensional grating is possible, offering 
25 less need for real estate on the wafer 

In alternative embodiments the data contains at 
least one spectroscopic meastirement that is not at normal 
incidence, i.e., q ^ 0, to assist in distinguishing the 
two dimensions. In this case the rotation of the wafer 
30 with respect to the optical system should be controlled 
so that f is controlled. 

Figure 24 shows a processing tool 240. The 
tool comprises at least one port 242 for loading saniples 
to be processed, at least one robot 244 for transporting 
35 samples within the tool, at least one process module 246 
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for actually applying a manufacturing process to a sam- 
ple!, and an optical instrument 40, as described above 
with Figure 4, Tbe process module may be a lithograpl^ 
stepper for eseposiiig Ediotoresist on a wafer, a developer 
for developing photoresist, a bake or cool plate, a spin- 
ner, an etch chamber, a deposition chamber, or any other 
processing Cool known in the art. in the preferred em- 
bodiment processing tool 240 is a lithography track with 
a stepper, and process module 246 is a photoresist devel- 
oper. 

Samples to be processed are loaded into port 
242, and passed by robot 244 to the process module for 
processing. After the processing is done, robot passes 
the sample to optical apparatus 40, which measures at 
least the overlay error of the developed film relative to 
an underlying filih. If the overlay is acceptable, the 
sample is returned to port 242 (or another one like it) , 
possibly after other manufacturing steps, if the overlay 
is deemed unacceptable, preferably action is taken to 
correct the error on the measured wafer, i.e., the 
photoresist is stripped and the wafer is reprocessed with 
adjusted process parameters. Alternatively, action is 
taken to prevent or reduce such errors oh future sazqples. 

Figure 12 shows the preferred embodiment of the 
method where the top grating 122 is composed of developed 
photoresist on top of TEOS layer 121 which will be etched 
in a following process step. The method alternatively 
can be applied when the top layer is resist that has been 
exposed by the lithography tool but not yet developed. 
Thus the top grating 122 would be a so-called latent 
image in the ej^osed photoresist. The latent image com- 
prises variations in the optical properties between ex- 
posed and unexposed regions of the resist, and/or topogr 
raphy in the top surface due to differential shrinkage 
due to eacposure. In many cases, the optical character- 



wo 02/065545 



PCT/IJS02/04190 



ization is preferably performed after a bake process, 
e.g., for so-called chemically amplified resists* The 
advantage of. using the latent image as the top grating is 
. that errors can be discovered sooner, less process time 
5 wasted and possibly fewer samples produced with such 
errors. However, the latent image does not scatter as 
strongly as the developed resist. 

In additional embodiments, the top grating 122 
miay comprise an etched pattern, for example, the upper 

10 surface of TEOS layer 121 of Figure 12 after etching, in 
these cases « the photoresist may or may not still be 
present, and there may or may not be deposits on the side 
walls of the etched trenches 124 and 126. These addi- 
tional components are typically removed by ashing and/or 

15 wet cleaning after the etch process. It is advantageous 
from the timing point of view to measure the overlay 
error before these are removed,, however, it is easier . 
from a itVodeling point of view to do it afterwards. 

In yet additional embodiments, as shown in 

20 Figure 25, region 252 separating lower grating 254 and. 

upper grating 256 raiay con^rise optically lossy materials, 
so that little or nd optical energy passes between the 
two gratings. Such situations may arise in microelec- 
tronics manufacture when patterning the intervening mate- 

25 rial 256 to form poly-silicon gates or Damascene metal 

interconnects. In such cases, ancillary physical proper- 
ties, such as the topography of surface 258 due to the 
presence of underlying grating 254, provides sufficient 
modification of the optical characteristics to allow 

30 measurement of overlay with the same general method, If 
a theoretical model is used to invert the data, it would 
coiiiprise, .for example, the loss in region 252, the topog- 
raphy of surface 258, and the offset between that topog- 
raphy and grating 256. 
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The above descriptions refer to gratings. 
Periodic » laterally Cartesian gratings are preferred at 
the present time due to speed limitations. o£ conputa- 
tional methods and hardware for the scattering from the 
structures. However, the above methods are also applica- 
ble to more general scattering structures which may be 
more practical when models to describe their scattering 
become available, ihae the above methods apply to non* 
periodic 'gratings', e.g., variable pitch gratings and 
'siiigle-period gratings', non-Cartesian gra tinges (e.g., 
generally circular gratings), and the like. Also, the 
above descriptions iiciplied that the upper and lower grat- 
ings have the same pitch(es) and orientation. However, 
the methods are applicable to cases where the upper and 
lower gratings have different pitches and/or different 
orientations. For example, as computational hardware and 
methods advance, overlay error may be measured directly 
on the "device structures' on the wafer, without using 
specially designed test structures that are typically 
built in otherwise ''wasted'' regions, e.g., scribe lines. 
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Claims 

1. A method of measuring alignment accuracy between two 
or more patterned layers fotmed on a sxibstrate 
5 coinpriaing: 

forming test areias as part of the 
patterned layers, wherein a first diffraction grating is 
built into a patterned layer A and a second diffraction 
grating is built into a patterned layer B, where layers A 

10 aind B are desired to be aligned with respect €o each 

other, zero or more layers of other materials separating 
layers A and the two jgratings substantially 
overlapping when viewed from a direction that is 
perpendiculaiic to the surfaces of A and B; 

15 observing the overlaid diffraction gratings 

usdLng an optical instrument capable of measuring any one 
or more of transmission, reflectance, or ellipsometric 
peirameters as a function of any one or more of 
wavelength, polar angle of incidence, azimuthal angle of 

20 incidence, or polarization of the illumination cuid 
de tec t ion ; and 

determining the offset between the gratings 
from the measurements from the optical instrument using 
an optical mod^l, wherein the optical model accounts for 

25 the diffraction of the electromagnetic waves by the 

gratings and the interaction of the gratings with each 
other's diffracted field. 

30 2 . The method of claim 1 wherein any layers between the 
grating in layer A and the grating in layer B are at 
least partially transparent at the wavelength range of 
the optical instrument. 
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3 . The method of claim i wherein at least one layer 
between the grating in layer A and the grating in layer B 
is opaque in the wavelength range of the optical 
instrument, aiid the presence of the grating in layer A 
causes a gi?a ting- shaped topdgraphy on the surface of the 
opaque layesr. 



4. The method of claim 1 wherein the optical model 
represents the electromagnetic field in the gratings arid 
in the layers between the gratings as a sum of more than 
one diffracted orders. 



5. The method of claim 1 wherein offset is determined 
by; 

calculating, according to a model of a wafer 
saiqple, the optical response of the sample with the said 
two overlapping gratings, the model of the sanple taking 
into account parameters of the sample including any of 
the overlay misalignment of layers A and B, the profiles 
of the grating structures, and asymmetries caused in the 
grating structures by manufacturing processes; 

changing the parameters of the sai^ple model to 
minimize the difference between the calculated and 
measured optical responses; and 

repeating the previous two steps until the 
difference between the calculated and measured optical 
responses is sufficiently small or cannot be 
significantly decreased by further iterations. 

6. The method of claim 5 wherein at least a portion of 
the calculation is done at the measurement time. 
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7. "the method of claim 5 wherein at least a portion of 
the calculated optical response is retrieved from a pre- 
conputed . database . 

5 

8. The method of claim 5 wherein the calculation 
involves interpolating the optical response from pre- 
con^ute'd emtries in a database. 

10 

9* The method of claim 1 wherein the first and second 
diffraction gratings have different pitches. 

15 10. The method of claim 1 wherein at least one of the 
two gratings contains more than one line per pitch « the 
widths of the at least two lines in each pitch (unit 
cell) being stabstantially different from each other. 

20 

11. A method of measuring alignment accuracy between two 
or more patterned layers formed on a substrate 
comprising: 

forming test areas as part of the patterned 
25 layers, wherein a first diffraction grating is built into 
a first patterned layer and a second diffraction grating 
is built into a second patterned layer, the two gratings 
substantially overlapping when viewed from a direction 
that is perpendicular to the surfaces of A and B, and at 
30 least one of the first or second gratings having a 

repeating pattern consisting of at least two structures 
of substantially different lateral dimensions; 
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measuring the optical characteristics of the 
overlaid diffraction gratings using an opTtical instrument 
with a spot size covering at least two repeats; and 
diBterndning the offset between the gratings from the 
measured optical characteristics. 



12. An apparatus for determining overlay error between 
two or more patterned layers of a sample, conprising/ 

a metrology target comprising a first 
diffraction grating built into a patterned layer A and a 
second diffraction grating built into a patterned layer 
B, where layers A and B are part of the sanple under test 
and layers A and B are desired to be aligned with respect 
to each other, the two gratings substantially overlapping 
when viewed from a direction that is perpendicular to the 
layers A and B; 

on optical instrument that illuminates part or 
all of the metrology target and that measures properties 
of light that has interacted with the metrology target as 
a function of any one or more of polar angle of 
incidence, azimuthal angle of incidence, and polarization 
of the illumination and detection; and 

a processor which estimates the offset of the . 
grating pair from the measured properties. 



13, The apparatus of claim 12 wherein the first and 
second diffraction gratings have different pitches. 

14. The apparatus of claim 12 wherein at least one of 
the two gratings contains more than one line per pitch, 
the widths of the at least two lines in each pitch (unit 
cell) being substantially different from each other. 



.WO02/M»S545 PCT/US02/041M 

-43- * 

15. The a£^ratu6 of claim 12 wherein at least one .other 
layer of material s^)arate8 layers A and B at tte 
irietrolbgy target. 

5 

16. The apparatus of claim 12 wherein the optical 
instrument measures properties of light that has 
interacted with the metrology target as a function of 
wavelength. 

10 

17. The apparatus of claim 12 wherein the processor has 
been programmed to iteratlvely (i) calculate an optical 
response for a set of sanqple parameters, including 

15 overlay misalignment, (ii) comg;>are the measured 

properties with the calculated optical response, and 
(iii) chanige one or more san^le parameters so as to 
minimize the difference between the measured properties 
and the calculated ojDtical response, 

20 wherein the calculation of an optical response 

is according to an optical model of the sample that 
accounts for the diffraction of electromagnetic waves by 
the pair of gratings of the metrology target and the 
interacti6n of the gratings with each other's diffracted 

25 field. 

18. The apparatus of claim 17 wherein the processor has 
access to a pre-computed database from which at least a 

30 portion of the calculated optical response can be 
retrieved. 
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19. The apparatus of claim 18 wherein the calculation 
performed by the progranoned processor involves 
interpolating the optical response from pre-cosiputed 
^tries din said database. 

20, An apparatus for determining the overlay error 
comprising, 

a metrology target comprising a first 
diffraction grating built into a patterned layer A and a 
second diffraction grating ie built into a patterned 
layer B, where layers A and B are desired to be aligned 
with respect to each other, the two gratings 
substantially overlapping when viewed from a direction 
that is perpendicular to the layers A and B; 

an ellipsometer that illuminates part or all of 
the metrology target and that measures properties of 
light that has interacted with the metrology target; and 

a processor which estimates the of f set of the 
grating pair from the pair's measured optical 
characteristics. 



21. The method of claim 20 wherein first and second 
diffraction gratings have different pitches. 

22. The apparatus of claim 20 wherein at least one of. 
the two gratings contains more than one line per pitch, 
the widths of the at least two lines in each pitch (unit 
cell) being substantially different from each other. 
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23. The apparatus of claim 20 wherein at least one other 
layer of rnaterial separates layers A and B at the 
metrology target. 

•5 • 

24. The apparatus of claim 20 wherein the eLli^someter 
measiires properties of light that has interacted with the 
metrology target as a function of wavelength. 

10 

25. T^e apparatus of claim 20 wherein the processor has 
been programmed to iteratively (i) calculate an optical 
response for a set of . sanple parameters « including 
overlay misalisfnmexit* (ii) centre the measured 

15 properties with the calculated optical response, and 
(iii) change one or more saiqple parameters so as to 
minimize the difference between the measured properties 
and the calculated optical response^ 

wherein the calculation of an optical response 

20 is according to an optical model of the saii5)le that 

accounts for the diffraction of electromagnetic waves by 
the pair of gratings of the metrology target and the 
interaction of the gratings with each other's diffracted 
field. 

25 

26. The apparatus of claim 25 wherein the processor has 
access to a pre-coit^puted database from which at least a 
portion of the calculated optical response can be 

30 retrieved. 
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27. The apparatus of claim 26 wherein the calculation 
performed by the programmed processor involves 
interpolating the optical response from pre-con^ted 
entries in said database. 
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